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Abstract. We study the small time asymptotics of the gradient and Hes- 
sian of the logarithm of the heat kernel at the cut locus, giving, in principle, 
complete expansions for both quantities. We relate the leading terms of the 
expansions to the structure of the cut locus, especially to conjugacy, and we 
provide a probabilistic interpretation in terms of the Brownian bridge. In par- 
ticular, we show that the cut locus is the set of points where the Hessian blows 
up faster than l/t. We also study the distributional asymptotics and use them 
to compute the distributional Hessian of the energy function (that is, one-half 
the distance function squared). 



1. Introduction 

Let M be a compact, connected, smooth Ricmannian manifold of dimension n. 
For any point x £ M, we use Cut(a;) C M to denote the cut locus of x. In partic- 
ular, since M is compact, Cut (a;) will be nonempty for every x. The Riemannian 
metric induces a distance function dist(x, y). We will also need to consider the 
energy function, E(x,y) = | dist(ir, y) 2 . Let A be the Laplace-Beltrami operator 
on M, that is, if normal coordinates centered at a point p, then 

A = X^"=i ~§^? a ^ P- The heat kernel Pt{x,y) is the fundamental solution to the 
heat equation dtu(t,x) = ^Au(t,x). 

A well-known result of Varadhan states that t log pt(x, y) converges to —E(x, y) 
as t \ uniformly on all of M, Motivated by this result, we define 

Et(x,y) = -tlogp t (x,y) 

and observe that E t (x,y) — > E(x,y) uniformly. Malliavin and Stroock ^2] have 
shown, using pathspace methods, that away from the cut locus, spatial derivatives 
of E t {x,y) commute with taking the limit as t \ (for an analytic proof, see 
Clearly, the lack of differentiability of E(x, y) at the cut locus means that something 
else must be occurring there (see Bishop [S] for a brief discussion of the smoothness 
properties of the distance function at the cut locus). Indeed, in the same paper, 
Malliavin and Stroock use pathspace integration to show that, if the set of minimal 
geodesies connecting x and y is sufficiently "nice," then 'V 2 E t (x, y) is asymptotic to 
— l/t times the variance of a random variable on path space as t \ 0. Unfortunately, 
their analysis is too complicated to obtain more detailed information. 
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In the present paper, we develop an analogous, but purely finite-dimensional, ap- 
proach which allows a much more detailed analysis of the small-time asymptotics 
of the gradient and Hessian of E t (x,y). In particular, we will show how complete 
asymptotic expansions of the gradient and the Hessian of E t (x,y) can be repre- 
sented as an integrals over the set of midpoints of minimal geodesies from x to y. 
These general expansions are given in Theorems Eland 01 respectively. The leading 
term in each of these expansions, given in Equation is fairly accessible both 

to analysis and to a probabilistic interpretation, and we will show how the small 
time behavior of ~S7 2 E t {x,y) reflects the structure of minimal geodesies from x to 
y. In particular, we will show (see Theorem 1101 below) that V 2 E t (x,y), as a qua- 
dratic form on T y M, is unbounded as t \ if and only if y £ Cut(x). Finally, in 
Theorems II II and 1121 we show how the asymptotic expansion of V 2 E t (x,y) can be 
used to determine the distributional Hessian of E(x,y). 

We note that our methods are an extension of a procedure due to Molchanov |T5) 
of representing the heat kernel itself as an integral over the midpoints of minimal 
geodesies, allowing him to determine the rate of decay of the heat kernel at points 
y £ Cut (a;) in a variety of cases. 

A survey of the results in the present paper can be found in [16| . 

I would like to thank my advisor, Dan Stroock, for his invaluable suggestions 
throughout the course of this work. I also thank David Jerison and Joe Harris for 
helpful discussions. 



Our main tool will be a pair of formulas which express the gradient and the 
Hessian of E t (x, y) as integrals over the set of midpoints of minimal geodesies from 
x to y, the asymptotics of which are amenable to study. The present section is 
devoted to the derivation of these formulas. 

2.1. Preliminary results. We will need various facts about the small time asymp- 
totics of the heat kernel, which we present here. 

Let Cm C A'/xMbe the set of pairs of points (x, y) such that y £ Cut(a;). Away 
from the cut locus, we have the following asymptotic expansion of the heat kernel, 
due to Minakshisundaram and Pleijel j!4) (see [2J for a more modern development). 

Theorem 1. Let M be a smooth, complete Riemannian manifold of dimension n. 
Then there are smooth functions Hi(x,y) defined on (M x M)\Cm such that the 
asymptotic expansion 



holds uniformly as t \ on compact subsets of (M x M)\Cm- Further, if y = 
exp x (Y), then Ho(x,y) is given by the reciprocal of the square root of the Jacobian 
of exp^. at Y . 

For future use, let k(t,x,y) = (2nt) n ' 2 e E ( x,v '' t pt(x,y) be defined away from 
the cut locus, so that k(t,x,y) ~ IZi=o J/)**- Recall the result of Varadhan 
mentioned in introduction; namely that E t converges to E uniformly on all of M. 
We will rewrite this as 



2. The representation as Laplace integrals 




(1) 



Pt(x,y) = exp 



-E(x,y) +6(t,x,y) 
t 
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where S(t, x, y) is some function which goes to uniformly in t on all of M. Inci- 
dentally, one cannot hope to replace S(t, x, y) with a power series expansion. Even 
in the simplest case of the heat kernel on S 1 , we see that 5(t, x, y) fails to be 0(t) 
for any x and y. 

Having summarized the small-time asymptotics of the heat kernel itself both 
away from and on the cut locus, we now turn to the log derivatives. As mentioned, 
Malliavin and Stroock (for m — 1,2) and Stroock and Turetsky (for m > 2) 
have proved that 

(2) W m E t (x,y)^W m E(x,y) 

uniformly on compact subsets of Af\Cut(x), where V m is the mth covariant de- 
rivative and all derivatives are taken in the y variable. Next, we will need better 
control over the convergence of W m E t (x,y) away from the cut locus. 

Lemma 2. Let M be a smooth, complete Riemannian manifold of dimension n. 
Then there are smooth functions Gi(x,y) defined on (M x M)\Cm such that, for 
any positive integer m, the asymptotic expansion 

oo 

iV m \ogp t {x, y)~J2 V m Gi(*, y)f 
i=0 

holds uniformly as t \ on compact subsets of (M x M)\Cm- Further, the Gi 
are given in terms of the Hi by taking the log derivatives of the Minakshisundaram- 
Pleijel expansion of Theorem^ and in particular, Go(x,y) = —E(x,y). 

This lemma is a direct consequence of the fact that the Minakshisundaram-Pleijel 
expansion can be differentiated. For a probabilistic proof of this fact see [2] ; for an 
analytic proof see |3] . Note that we get a power series expansion only if we take at 
least one spatial derivative. Again for future use, let l(t, x, y, A) = t\7 A^ogpt(x,y) 
be defined away from the cut locus, so that it has the above expansion. 

Finally, we have a result of Stroock and Turetsky which states that 



(3) \V m Pt (x,y)\ < D n 



dist(x,y) 1 



Pt(x,y) 



t Vt. 

where the D m are some constants depending only on M. The point is that this 
estimate is valid even when y E Cut(x). Note that the results of Stroock and 
Malliavin mentioned above for m = 2, and the results of Stroock and Turetsky for 
m > 2, show that the power of t in Equation © is sharp. 

2.2. The gradient. In this section, we will prove a theorem describing the asymp- 
totic expansion of the gradient of E t (x,y) which is valid everywhere on M. We 
begin by introducing some notation. Fix any two distinct points x and y on M. 
Let r be the set of midpoints of minimal geodesies from x to y (for example, in 
the case M = S n with x and y the north and south poles, T is the equator). By 
compactness, there exists e > such that the e- neighborhood of T, denoted r e , 
is strictly positive distance from x, y, and both of their cut loci. Also, we define 
the hinged energy function h x ^ y (z) — E(x,z) + E(y,z). Note that h x>y obtains its 
minimum precisely on T and that this minimum is equal to E(x,y)/2. 

Theorem 3. Let M be a smooth, compact, connected Riemannian manifold. Choose 
any two distinct points x and y. Then there exist positive constants C and A such 
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that, for any A € T y M, we have 

V A E 2t (x,y) = - 



Jr e l(t,z,y,A)exp 


t 


k(t, x, z)k(t, y, z) dz 


Jr. ex P 


t 


k(t, x, z)k(t, y, z) dz 



+ e(t,x,y) 



where lit, x,y, A) andk(t,x,y) are as above and \e(t,x,y)\ < C exp (— X/t). 

Proof. Choose and fix some (x,y) £ M x M. Let F and T £ be as above with 
e small enough so that r c is a strictly positive distance from Cut (a;) and Cut(y). 
The Chapman-Kolmogorov equation gives 



P2t(x,y)=J pt(x,z)p t (z,y)dz + 
Using Theorem ^and Equation (J), we have 

fox,y (^) 



M\T e 



Pt(x,z)p t (z,y) dz. 



P2t(x,y) 



1 

2?rt 



exp 



k(t, x, z)k(t, y, z) dz 



exp 



M\r e 



- [h x ,y{z) + 6{t,x,yj) 



dz. 



Observe that h x , y (z) is continuous and achieves its minimum precisely on T. Hence 
its minimum on M\T e is strictly greater than its minimum on T e / 2 - It follows that 
there exist positive A and C such that 



(4) p 2t (x,y) = [l + e(t,x,y)} 



1 



2ttT 



exp 



hx,y (^) 



k(t, x, z)k(t, y, z) dz 



where |e(t, x,y)\ < Cexp(— X/t) (see Lemma 5.3.1 of Hsu's book ^U], where he 
gives essentially this result with a more detailed proof). 
Again use the Chapman-Kolmogorov equation to write 



P2t(x,y) 



p t {x 1 z)p t {z 1 y)dz. 



Taking derivatives (all derivatives are on the y variable) gives 

^AP2t(x,y)= / Pt(x,z)[V A p t (z,y)]dz. 

We divide the manifold into three regions. Let e > 0, T, and r e be as before. Let 
C e be an e-neighborhood around the cut locus of x. We now demand that e also 
be small enough that these sets are a strictly positive distance from one another. 
Finally, let R e = M\(C e U T e ) be the rest of M. Then 



^AP2t{x,y) 



p t (x,z) [V A p t (z,y)} dz 



Pt(x,z) [V A Pt(z,y)] dz 

Pt(x,z) [W A p t (z,y)] dz. 



On r e and R e we are a strictly positive distance from the cut locus. Also, note that 
V A p t (z, y) = p t (z, y)V A logp t (z, y) 



HEAT KERNEL AT CUT LOCUS 



5 



Then we have 
V A p 2 t(x,y) ■ 



r,ufl 8 



7'(M,y) ( ^ ) exp 



fe(t, x, z)k(t, y, z) dz 
Pt(x,z) [V A p t {z,y)\ dz. 



So now the problem is to control the last integral. For this we use Equations JTJ 
and 1(3}. This gives us the bound 



Pt(x,z) [V A Pt{z,y)] dz 



< 



diam(M) 1 
~7t 



t 



exp 



-- (K, y (z) + 6(t,x,y)) 



dz. 



To get the log gradient, we need to divide through by p2t{x,y), which we write as 
an integral using Equation We now claim that the terms involving R e and C' e 
vanish exponentially fast. Indeed, we've already seen this during the derivation of 
Equation Q. Thus we have 



V A \ogp 2t (x,y) = 



Jr e l(t,z,y) exp 




y( z ) 
/ 


k(t, x, z)k(t, y, z) dz 


/r e ex P 


^a: , y ( % ) 
t 


k(t, x, z)k(t, y, z) dz 



+ e(t,x,y) 



where e(t, x, y) and e(£, x, y) are both bounded in absolute value by C exp (— A/T), 
for some positive C and A (perhaps different from above). Further, we have that 
1/(1 + e(t,x,y)) = 1 + 0(e(t,x, y)). Hence, by adjusting e(t, x, y), C, and A, we 
can get rid of e(T,x,y). Recalling the definition of E t (x,y), we have proved the 
theorem. □ 

2.3. The Hessian. Here we develop the analogous formula for the Hessian. By 
polarization, it is sufficient to consider ~S/ 2 A A E2t{x,y). 

Theorem 4. Let M be a smooth, compact, connected Riemannian manifold. Choose 
any two distinct points x and y. Then there exist positive constants C and A (pos- 
sibly different from the constants in Theorem^) such that, for any A € T y M , we 
have 



V\ A E 2t {x,y) 



Jr e G(M,y,,4)) 2 exp 




fax ,y \%) 
t 


k(t, x, z)k(t, y, z) dz 


Jr e ex P 


fax , y ( 

t 




k (^fcf x , 


z)k(t, y, z) dz 



Jr e l{t,z,y,A)exp 


t 


k{t, x, z)k(t, y, z) dz 


ir e ex P 


t 


k(t, x, z)k(t, y, z) dz 



Jp £ V A l{t, z,y,A) exp 




t 


k(t, x, z)k(t, y, z) dz 


Jr. ex P 


fax , y (^0 

t 


k (t, x , 


z)k(t, y, z) dz 



+ e(T,x,y) 



where l(t,x,y, A) and k(t,x,y) are as above and \e(t,x,y)\ < C exp (— X/t). 
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Proof. We begin by observing that 



(5) Vl A log/ 



/ 



(V^log/) 2 



Since we have computed the log gradient of the heat kernel in our proof of Theo- 
rem|31 the only thing remaining is for us to compute the Hessian of the heat kernel. 
Again we start with the Chapman-Kolmogorov equation and differentiate under 
the integral sign to get 



^A,AP2t(x,y) 



p t (x,z)V AA p t (z,y) dz 



r E ui? £ 



v 1a \ogp t {z, y) + (log p t {z, y)) p t (z, y)p t (z, x) dz 



Pt(z,x)V 2 A A p t (z,y) dz 



r t uR c 

1 
2vrf 



~V A l(t,z,y,A) + ^(l(t,z,y,A)) 2 



exp 



h>x,y(z} 



k(t, x, z)k(t, y, z) dz 



Pt(z,x)V 2 A A p t (z,y) dz. 



We wish to divide both sides by P2t(x,y), since that's what appears in the ex- 
pansion of the log Hessian. We can use Equation to control the integral over 
C e (in particular, it decays exponentially), and the integral over R e also decays 
exponentially, for the same reasons as before. Thus (refer to Theorem |3J| 

V% A p 2 t(x,y) _ 
P2t(x,y) 



1 J 


1 Jr. ^Al{t,z,y,A)exp 


t 


k(t, x, z)k(t, y, z) dz 


1 + e(t,x,y) 


K 1 Sr. ex P 


t 


k(t, x, z)k(t, y, z) dz 



1 Jr. W,z,y,A)f exp 


t 


k(t, x, z)k(t, y, z) dz 


Jr £ ex P 


hx , y 1 

t 




k(t, x, z)k(t, y, z) dz 



+ e(t,x,y) 



where we change C and A as necessary. 

We now plug all of our results from above into Equation |JSJ. The theorem then 
follows. □ 

These theorems work well when we wish to compute the asymptotics of the log 
gradient or log Hessian with respect to fixed x and y. However, if we wish to study 
how the asymptotics change as y moves, say into the cut locus, then they won't be of 
much use. This is because T e and A can change discontinuously in y, which we can 
see just by looking at § 2 and letting y move into Cut(a;). Fortunately, our derivation 
of the theorem makes it clear how to solve this problem. Let O be the union of 
the sets T associated to every y € Cut(x). Then O is still a uniformly positive 
distance from Cut(x). Let O e be an e-neighborhood, chosen small enough to still 
be a uniformly positive distance from Cut(cc). Then if we choose y € B e (Cut(x)), 
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everything we've done above works with T and T e replaced by O and O e . This 
will allow us to reduce all questions of what happens when y is moved to studying 
how our integral operators change. Said informally, all we've done is take all the 
parts of R which will become relevant as we move y and make them part of 0, our 
new region of interest. Thus all of the important behavior takes places in O. This 
modification is worth formalizing. 

Corrolary 5. Let M and x be as above. Then for any y S B e (Cut(x)), the expan- 
sions in Theorem^ and Theorem^hold with T e replaced throughout by O e . 

3. The leading terms 

Theorems and 0] give, in principle, the complete asymptotic expansions of the 
gradient and Hessian of E t (x 7 y), up to terms which vanish faster than any power 
of t. However, if we restrict our attention to the first few terms of these expansions, 
the formulas simplify considerably. 

3.1. Formulation in terms of expectation and variance. Considering the 
leading terms for the gradient and Hessian of E t (x,y), we have 

V A E t {x,y) = 

f Jr E VA-B(z,y)exp [-\h x , y {z)] H (x, z)H (y, z) dz 
y Jp^ exp [-f h x , y (z)] H (x, z)H (y, z) dz 

and 

V% A E t (x,y) = 

4 fir, (^AE(z,y)) 2 exp [-jh x>y (z)] H (x, z)H (y, z) dz 
t | J Ft exp [-- h X} y(z)] H (x, z)H a (y, z) dz 

/ rt VaE(z, y) exp [-jh x . y (z)] H {x,z)H (y,z)dz 
It, ex P [~l h x,y( z )] H o(x, z)H (y, z) dz 

Note that we're now looking at E t (x,y) at time t rather than at time 2t. 

To begin, we can give a probabilistic interpretation of the constant term of the 
gradient and the 1 jt term of the Hessian. Let 

^(dz)^^^H(x,z)H(y,z)ex P ^-^^j dz 

f f 2/i iz\ \ 
where Z t = J H(x, z)H(y, z) exp ( \ dz. 

Then /i t is a probability measure supported on T e , and the above becomes 
V A Et(x, y) = 2Ef* [V A E(z, y)} + 0(t) 

and V% A E t (x,y) = -^Var^ [V A E(z, y)\ + 0(1). 

Since T e is compact, so is the space of probability measures supported on r e (in 
the weak topology). In particular, if we take any sequence of times decreasing to 0, 
then it will have a subsequence U such that /i ti converges to some limit probability 
measure \i supported on T. Let M be the set of all such limit measures. For 




0(1). 
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any /i 6 Mo, we will say that U is an associated sequence of times if /i^ converges 
(weakly) to \x. 
We know that 

W A E(z, y) = dist(z, y) (A, V dist(z, y)) 

= -tist(x,y) (A,Y(z)) 

for z £ r, where l^(-z) is the (unique) unit vector at y such that 

exp y [-Y(z) dist(z, y)] = z. 

Further, if we let 0a (2) be the angle between A and Y(z), then we can write 
(A, Y(z)) as |A| cos 9a(z). Thus, if we choose any /x £ Mq and let be an associated 
sequence of times, we have 

lim V A E ti (x,y) = \A\ dist(x, y)E' i [cos6U(z)] 
and lim UV 2 AA E u (x,y) = -\A\ 2 dist(x, yf Var M [cos6U(z)] ■ 

i — >oo ' 

Some elementary facts about the log gradient and log Hessian follow immediately. 
If we homothetically scale M by a factor of a > 0, then lim^oo V A E ti (x, y) is 
multiplied by a and lim^oo a^u( x iU) ^ s multiplied by a 2 . Also, we have the 
pair of inequalities 

— \ A\ dist(x, y) < liminf S/ A Et{x, y) < lim sup V A E t (x, y) < \A\ dist(x, y) 
and - \A\ 2 dist(i, yf < liminf tS/ A A E t (x,y) < limsuptV^ A E t (x, y) < 0. 

3.2. Relation to path-space integration. There is a one-to-one correspondence 
between T and the set of minimal geodesies from x to y. This suggests that we 
think of z £ T as parameterizing these minimal geodesies, of any /x £ Mq as a 
measure on the set of minimal geodesies, and of cosd A (z) as a function on the 
minimal geodesies (in particular, cos A (z) is the cosine of the angle between A and 
the geodesic corresponding to z). This viewpoint can be fleshed out by considering 
the Brownian bridge. (Intuitively, the Brownian bridge from x to y at time t is the 
stochastic process obtained from Brownian motion by conditioning on the particle 
starting at x and being at y at time i; for a more detailed discussion see, for 
example, Hsu's book JO].) In particular, fix x and y, and let P* be the measure on 
path-space corresponding to the Brownian bridge from x to y at time t. It is well- 
known that Pt exists and that its finite marginal distributions are given in terms 
of the heat kernel. To be precise, let X T be the map from path-space to M which 
sends each path to its position at time r £ [0, t]. Then for any finite sequence of 
times = tq < ■ ■ ■ < r m < r m+ i — t, the joint distribution of X Tl , . . . , X Tm under 
Pt is 

7 V I I P^t+i— Tt O^is 

Pt{x,y) fj^ 

where, of course, xq = x and x m+ i = y. Now consider X t / 2 , and let v t be the 
distribution of X t / 2 under P t . Then the density of v t with respect to the volume 
measure on M is 

dv t _ Pt/2(x,z)p t/2 (z,y) 
rfvol Pt(x,y) 
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In order to study the limiting behavior, we integrate against a smooth test function 
(f(z). Also, we use Equation J3J in the denominator to get 

J M f( z )Pt/2(x, z)p t/2 (z, y) dz 



E 1 " Mz)} = 



[1 + e(t, x, y)] f T (±) exp [-jh XiV (z)] k(t/2, x, z)k(t/2, y, z) dz 



The contribution from the integral over M\T e in the numerator vanishes expo- 
nentially, and on r e we can use the Pleijel expansion. Proceeding as before, we 
conclude that 

™,, r ,M Ir c < Piz)exp[-^h Xt y(z)]H {x,z)Ho{y,z)dz 
Jp^ exp [-jh x ^{z)\ H (x,z)H a (y,z)dz 

It follows that fiti — * fi> if and only if v ti — ► /i. So not only can we think of /i as a 
measure on the set of minimal geodesies, but we can also say that it is the natural 
such measure induced by the Brownian bridge. 

The preceding allows us to view our method as a finite dimensional analogue of 
that used by Stroock and Malliavin, in which they work directly with the measure P t 
on pathspace. According to the heuristics of Feynman-type path integrals, Wiener 
measure should be thought of as the probability measure on pathspace given by 
weighting each path w(t) by a weight proportional to 



cxp^VMI 2 ^) 



(even though this is not possible in a rigorous sense) . If we consider paths from x to 
y at time t, then as t \ the above indicates that Pt, which is Wiener measure con- 
ditioned to require that the path be at y at time t, should be concentrating on paths 
that minimize energy These paths are, of course, the minimal geodesies joining x 
and y. Thus in the limit, we expect the infinite dimensional pathspace picture to 
collapse down to a finite dimensional picture on minimal geodesies. Understanding 
this collapse on pathspace is somewhat difficult, but the present approach avoids 
this by working directly on the manifold from the beginning. 

We will study the limiting measure (or measures) almost entirely in the context 
of geometric analysis, only occasionally remarking on the probabilistic interpreta- 
tion. Nonetheless, the probabilistically inclined reader is encouraged to think about 
the limiting measure /i (when it exists) as giving the probability that a Brownian 
particle travelling from x to y "instantaneously" does so via a particular minimal 
geodesic. 

3.3. An example. One application of Theorem 0] is the explicit computation of 
the asymptotics of \7 2 E t (x,y) when y G Cut(x). (Of course, one expects such an 
explicit computation to be possible only in special cases.) Here we show how this 
can be done on § n . For simplicity, assume that n > 2. We note that y € Cut(a;) if 
and only if x and y are antipodal points. Hence, without loss of generality, we take 
x and y to be the north and south poles, which we denote N and S. Then T is the 
equatorial sphere § n_1 (l). (By § n (r), we mean the standard n-dimensional sphere 
of radius r.) By symmetry, we see that fj, t converges to the uniform probability 
measure on the equatorial sphere (with respect to the induced volume measure). 
Next, let A be any unit vector in T y M (it doesn't matter which one, again by 
symmetry). Decomposing the equatorial sphere into level sets of Oa(z), we see that 
the level set for any 9 is S n ~ 2 (sin6>). 
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We know that the gradient of Et (N, S) is zero by symmetry. Thus we proceed to 
computing the Hessian. Let uj m denote the volume of the unit sphere of dimension 
to. We have 

[cos 2 9 A (z)] = — !— / (J cos 0) 2 (sin 9) n - 2 u n _ 2 d6 

and E»[cos0 A (z)} 2 = (^^j ^ cos 9(siri 9) n - 2 d9 

This second term vanishes because cos# is anti-symmetric about tt/2 while sin# is 
symmetric. Using this in our formula for the Hessian gives 

2 pit 

limt[V AA E t {N,S)] = U)n ^ / (cos 6) 2 (sin 9) n - 2 d6 

*V> ' Wn-l Je=0 

n 

We conclude that V 2 AA E t (N, S) ^\A\ 2 as t \ for any A e T S M (the above 

computation assumes that n > 2, but this formula extends to the case n = 1 as can 
be checked easily by hand). 

3.4. The relation to conjugate points. In the case of S™, we were able to de- 
termine the limiting measure /i using only symmetry considerations. In general, 
this won't be so easy, and the limiting measures (or measures) will depend on the 
behavior of h x<y near its minima. In particular, the limiting measures will be gov- 
erned by whether or not these minima are degenerate (in the sense of Morse theory, 
that is, whether or not the Hessian is positive definite) and, if so, how degenerate 
they are. However, before discussing the relationship between degeneracy and the 
asymptotics of /i t , we wish to relate this degeneracy to the geodesic geometry of 
the manifold. 

We begin by introducing some terminology Given a smooth, real- valued function 
/ which is defined in a neighborhood of the origin in K™ and a non-negative integer 
to, we will say / is constant to exactly order to at the origin in the direction £ G §™ _1 
if 

(d t y [/(o-/(o)] t=0 

is zero for < i < m but is non-zero for i = m + 1. In particular, / is constant to 
exactly order in the direction £ if its first derivative in the direction £ is non-zero, 
/ is constant to exactly order 1 if its first derivative is zero but not its second, and 
so on. We will say that / is constant to finite order in the direction £ if there exists 
some non-negative integer to with / constant to exactly order m, and we will say 
that / is constant to order at least to in the direction £ if the derivatives above 
vanish for < i < to. 

Now, let 7 be a minimal geodesic from x to y, and take (r, 6\, . . . , 9 n -\) to 
be a polar coordinate system on T X M such that 7(7") = exp x (r, 0, . . . , 0) for r e 
[0, dist(x, y)\. We then say that 7 is conjugate to exactly order to in the direction 
£ £ §™~ 2 if 9 ~~> cxp 2 .(dist(a;, y), 9) is constant to exactly order to in the direction £ 
(here §™~ 2 is the unit tangent space in the tangent space to §™ _1 at 9 = and thus 
the Jacobi field induced by £ is perpendicular to 7). In particular, 7 is conjugate 
to exactly order in the direction £ if it is not conjugate, in the usual sense, in 
this direction, and 7 is conjugate to some positive order if it is conjugate in the 
usual sense, with the order of conjugacy indicating how many derivatives of the 
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exponential map vanish in that direction. We will use the terms conjugate to finite 
order and conjugate to order at least m analogously to the above. 

The relationship between the degeneracy of the minima of h XtV and the conjugacy 
of the corresponding geodesies is contained in the following lemma. 

Lemma 6. Choose distinct points x and y on M and let d — dist(a;,y). Let 
(r, 0i, . . . , n -i) and 7 be as above, and choose some £ G §"~ 2 . The coordinates 
(r, 0\, . . . , n -i) on the tangent space induce coordinates in a neighborhood 0/7 
under the exponential map, and thus we can also ask to what order h XiV is constant 
at exp(d/2, 0, . . . , 0) in the direction £. Then h XjV is constant to finite order in the 
direction £ if and only if 7 is conjugate to finite order in the direction £. In this 
case, there exists a non-negative integer m such that h XiV is constant to exactly 
order 2m + 1 at cxp(d/2, 0, . . . , 0) in the direction £ and 7 is conjugate to exactly 
order 2m in the direction £. 

Proof. Let tp(t) be the angle between the geodesic from exp(d/2, t£) to exp(d, t£) 
and the geodesic from exp(d/2,t£) to exp(d, 0) = y. Let 

p(t) = dist(exp(d,t£),2/). 

Then, considering the dependence of geodesies on their starting points in the tan- 
gent bundle, we see that ip{t) and p(t) are comparable for small t. Thus, ip(t) is 
constant to exactly order I at (in the direction d t ) if and only p{t) is constant to 
exactly order / at (in the direction dt). From the definition of p(t), it's clear that 
p(t) is constant to exactly order I at if and only 7 is conjugate to exactly order I 
in the direction £. 

The distance from x to exp(d/2, i£) is independent of t, and the vector field 
given by pushing £ forward by the exponential map is always perpendicular to 
the geodesic from x to exp(d/2,t£). Thus, /i Xi!/ (exp(d/2, depends only on the 
distance between exp(d/2, t£) and y. Further, we see that the inner product between 
the push-forward of £ and the unit tangent to the geodesic from exp(d/2, i£) to y at 
exp(e?/2,t£) is constant to exactly order I (in the direction d t at t = 0) if and only 
if ip(t) is constant to exactly order I. Since this inner product is proportional to the 
derivative of /i Xi!/ (exp(d/2, t£)) with respect to t, it follows that ip(t) is constant to 
exactly order I if and only if h x y is constant to exactly order I + 1 in the direction 

£■ 

Combining these facts, we see that 7 is conjugate to exactly order I in the 
direction £ if and only if h XiV is constant to exactly order I + 1 in the direction 
£. The lemma will be proved once we determine that h Xty can only vanish to 
odd order. This, however, is just a restatement of the fact that the first non-zero 
derivative of h XiV in any direction must be even because h XyV has a local minimum 
at exp(d/2,t£). □ 

In particular this lemma implies that, if z € T, then z is a non-degenerate 
minimum of h XiV if and only if x and y are conjugate to exactly order in all 
directions, which, as mentioned, is the same as saying that x and y are not conjugate 
along 7 in the usual sense. On the other hand, if z is a degenerate minimum, then 
x and y are conjugate (in the usual sense) along the minimal geodesic through z, 
and furthermore, the index and orders of conjugacy 1 can be determined from the 
partial derivatives of h XiV . 



Some authors use the order of a conjugate point to denote the dimension of the null space of 
the differential of the exponential map. We will, taking our cue from Morse theory, instead call 
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3.5. Laplace asymptotics and the resolution of singularities. Having given 
a geometric interpretation of the minima of h Xt y, we now turn to the investigation 
of the relationship between the minima of h XtV and the limiting measures Mq. To 
do so, we will need to delve into the theory of Laplace asymptotics. We begin by 
considering the case when T consists of finitely many points, say z\, ■ ■ ■ , z m . For 
ease of notation, we write g{z) = 2h x ,y(z) — E(x,y); hence g is non-negative and 
has zeroes precisely at the Zj. By taking e small enough, we can ensure that r e is 
the union of the disjoint balls B e (zi), where i ranges from 1 to m. In this case, we 
see have that 



1 m r 

where Z(t) = 



ip(z)Ho(x,z)H (y,z)e- 9(z ^ t dz 



8=1 



H {x,z)H () {y,z)e-^)/ t dz. 



We are thus lead to study integrals of the form 



(7) / i P {z)e- g ^l t dz 

as t \ 0. 

First, suppose that g can be diagonalized at z%, that is, suppose that we can find 
coordinates tii, . . . , u n around Zi such that 



(8) 



/(til, ...,««) 



for some positive integers k\ < ■ ■ ■ < k n . Of course, at a non-degenerate minimum, 
the Morse Lemma guarantees the existence of such coordinates with kj = 1 for 
each j, but in general this need not be true. Under the assumption that (JSJ holds, 
Estrada and Kanwal give the full asymptotic expansion of Q ; namely, 



(9) 



ip(ui, ■ ■ ■ ,u„)exp 



B c ( Zi ) 



En 
.7 = 1 



n 



E 



dui ■ ■ ■ du r , 

( 2v+l\ 

\ 3 / 4- 2k^ 



^ (2ii)!fci 



d 
duj 



¥>(0,...,0) 



where derivatives on the right-hand side are to be understood as giving partial 
differential operators which are then applied to ip and evaluated at the origin (that 
is, at Zi). We will be interested mainly in the first term (that is, the one coming 
from ij = for all j), in which case Equation gives 



(10) 



B e (z 4 ) 



^{z)e-^l l dz = tV2fcM+-+i/2fc».i L vol^Mzi) + O ( < 1/fc - 



where vol u is the volume element in the u coordinate chart and Ci is a positive 
constant which depends only on n and the kj^s. 

Equation l(TU|l implies that the limit is dominated by those geodesies which are 
"the most conjugate," in the sense of being conjugate in many directions and/or 



that the index of the conjugate point and reserve the term order to denote the order to which the 
differential of the exponential map vanishes in some direction. 
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to high order. More precisely, assume that g can be diagonalized around each of 
its minima. Then to each z% we can associate the order of the leading term of 
the integral over B e (zi), which is U = l/2k\,i + ■ • • + l/2k n> i. Then, as t \ 0, fit 
converges to a limit fi which is supported on those z% with the smallest leading order 
(that is, Zi with U = min{Zi, . . . , l m })- Further, the mass at these points is given by 
the coefficient of the leading term of the expansion coming from (I10II . normalized 
to have total mass one. In terms of the Brownian bridge, this says that that the 
particle prefers to travel along the most conjugate geodesies. More precisely, if we 
require the particle to travel from x to y "instantaneously," then it travels along 
the geodesic through Zi with probability fi{zi\. To be a bit more intuitive, it is not 
be surprising that a particle under Brownian motion prefers conjugate geodesies, 
since conjugacy should make a geodesic more forgiving of the white noise which the 
particle experiences as it tries to follow the geodesic. 

In order to extend our analysis beyond the diagonalizable case, we will need to 
introduce resolutions of singularities. Let X be a real-analytic manifold of dimen- 
sion n (we do not require that X be compact), and let / be a real-analytic function 
on X which is not identically zero. Then, for our purposes, a resolution of singu- 
larities will mean an n-dimensional analytic manifold Y and a proper, surjective 
analytic map £ : Y — ► X such that £ possesses the following two properties: 

• At each point of the pre-image of the zero level-set of /, there exist local 
coordinates with respect to which / o £ is a monomial and the Jacobian of 
£ is a monomial times a non-zero smooth function; 

• The function £ is a diffeomorphism outside of the zero level-set of /. 

A famous theorem of Hironaka states that, for X and / as above, a resolution of 
singularities always exists. 

Remark. The original results of Hironaka are both much more general than 
what we have stated above and formulated in the language of schemes. A similar 
definition to the above is given in Arnold, Gusein-Zade, and Varchenko's book pQ, 
which is also concerned with the evaluation of Laplace asymptotics. An "elemen- 
tary," constructive proof in the case of characteristic zero (which includes the case 
of real-analytic manifolds) is given by Bierstone and Milman 0]. Finally, a purely 
analytic statement and proof of a weaker form of the resolution of singularities for 
real-analytic manifolds (in particular, without the property that (p is a diffeomor- 
phism off of the zero level-set) is given by Sussmann |20j. 

As a first application of the resolution of singularities, we can prove the following 
theorem. 

Theorem 7. Let M be a real-analytic, compact Riemannian manifold. Then for 
any distinct points x and y of M, the corresponding sequence of measures fit (see 
equation ^) described above converges to a unique limit fi as t \ 0. 

Proof. That M is real-analytic implies that the function g defined above is real- 
analytic in a neighborhood of T, and thus for an appropriate choice of e, on r e . 
Let if be any smooth function with compact support in r e . It is enough to show 
that E Mt [<p] converges as t \ 0, since the limit of fit is determined by its action on 
smooth test functions. 
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Following P], we define an elementary Laplace integral over a bounded, open 
neighborhood of € M™ as an integral of the form 

cxp x* 1 ■ ■ ■ x^ n /tj x l i ■ ■ ■ x l ™ ip(xi, . . . ,x n ) dxi . . . dx n 



u 

where the ki are non-negative, even integers, at least one of which is non-zero, the 
rrii are non-negative integers, and ip is a smooth function of compact support in 
U (we call ip the amplitude function). By Theorem 7.4 of PP, for any elementary 
Laplace integral there exists a distribution a, a positive rational number a, and a 
non-negative integer m such that this integral is asymptotic to a(tp)t a \ log t\ m as 
t \ 0. Further, if -0 is non-negative on U and positive at the origin, then a(ip) is 
positive. 

We now apply the resolution of singularities to g and T £ (where we think of r £ 
as a non-compact real-analytic manifold). Thus we have a real-analytic manifold 
Y and a map £ : Y — > T e with the properties listed above. Because V (which is 
the zero level-set of g) has measure zero and £ is a diffeomorphism elsewhere, it 
follows that an integral over T e can be written as an integral over Y by pulling back 
everything on T £ \ T (and ignoring the preimage of T in Y, which has measure zero 
under the pull-back of the volume form). In particular, we have that 

Z(t) = J (H (x, •) o 0(u)(H (y, ■) o £)(«) exp (-g o £(«)/*) f (dvo\)(u). 

Next, observe that restricting the integral to the preimage of the closed e/2 neigh- 
borhood of r only introduces an exponentially small error; and further, the same is 
true if we integrate over any set intermediate between an e/2 neighborhood and T e . 
Since this closed neighborhood is compact, so is its preimage under £. Hence it can 
be covered by finitely many coordinate charts (each of which is also contained in 
the preimage of r e ) of the type described above (namely, g is a monomial and the 
Jacobian of £ is a monomial times a smooth, non-zero function). Using a partition 
of unity subordinate to this cover, we can write Z(t) as a sum of finitely many, 
say I, elementary Laplace integrals (up to exponentially small error). In addition, 
we see that the amplitude function ipi in each of these integrals is non-negative 
and positive at the origin (in the local coordinates) because H , the volume form, 
and the partition functions have these properties. Thus, we see that the integrals 
over these charts are asymptotic to ai(ipi)t ai \ logt\ nii respectively, for i = 0, . . . , I, 
where the aj(V'i) are all positive. Picking out the dominant such term or terms, we 
conclude that there is some positive real number c, some positive rational number 
a, and some non-negative integer m such that 

Z{t) ~ct a \logt\ m ast\0. 

In order to determine the numerator in E Mt [<p], we follow the same procedure. 
The only difference is that now the amplitude functions in the elementary Laplace 
integrals also include the pull-back of the test function j^asa factor. In particular, 
we have the same coordinate charts as before, and now our amplitude functions are 
given by ipi = ipi{<-p o £). We can no longer guarantee that the ipi are non- negative 
and positive at the origins of our charts, and thus we cannot guarantee that a;^) 
is non-zero. However, note that we need only concern ourselves with the charts 
where a.i = a and m t = m. If the sum of the a,i(tpi) corresponding to these charts 
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is non-zero, then call it c and observe that 

J ip{z)H (x,z)H () {y,z)e~ 9 ^/ t dz^ct a \\ogt\ m as t \ 0. 

On the other hand, if the sum of these di{(pi) is zero, we cannot necessarily deter- 
mine the leading term of the expansion for this integral; however, we can assert that 
this integral is o(t a \ \ogt\ m ). In either case, dividing by Z{t) and letting t \ shows 
that [ip] converges to some finite limit. As mentioned above, this completes the 
proof. □ 

Note that this result says nothing about how to determine the limit /i, or more 
generally, about how it relates to the geodesic geometry of M. We suspect, although 
we have not proven, that if M is only assumed to be smooth, the limiting measure 
need not be unique. 

3.6. Newton polyhedra and evaluation of the limit. Our use of the resolution 
of singularities in the last section was non-constructive. In the present section, we 
discuss how additional assumptions about g allow one to say much more about the 
limit measure \i. 

We return to the case where M is smooth and g has finitely many minima, which 
we denote by Zi, although now we drop the assumption that g can be diagonalized 
near the Zj. In a series of papers (for example |2U) which culminate in the mono- 
graph pQ, Arnold and his school have provided a fairly complete analysis of the 
asymptotic expansion of equation J2J in this case. We summarize the needed re- 
sults. First, we need to assume that g vanishes to finite order at z%. This is always 
true in the real-analytic category, but in the smooth category it need not be the 
case. Given that g vanishes to finite order at Zi, we can in fact assume that g is 
real-analytic in a neighborhood of g by taking an appropriate change of coordinates. 
In particular, there exist coordinates Uj around Zi such that g is equal to its Taylor 
expansion in these coordinates. 

Before stating the results, we need to introduce some notation. Let Z™ be the set 
of all n-tuples of non-negative integers. Also, we will use the multi-index notation, 
so that, for a = (eti, . . . , a n ) € 1I\ we interpret u a as the monomial u" 1 ■ ■ ■ u" n . 
Then we have that 

9= 9 a u a 

aGZ™ 

where the are the coefficients in the Taylor expansion. Now let R" be the 

set of all n-tuples of non-negative reals, and consider Z™ to be a subset of R™ in 
the natural way. Then the Newton polytope of g is defined to be the subset of 
determined by taking the convex hull of U(a + R" ) where the union is over 
all a € Z™ such that g a ^ 0. Further, the Newton diagram of g, which we will 
denote by A(g), is defined to be the union of the compact faces of the Newton 
polytope. (Strictly speaking, both the Newton polytope and Newton diagram of g 
also depend on our choice of coordinates Ui, but for now we will simply consider 
the coordinates to be given.) 

Let 7 be a face of A(g). Then we define g 1 to be the power series consisting of 
the monomials whose multi-indices lie on 7 with the same coefficients as appear in 
the power series of g. That is, 

■97 = 9 a u a . 
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Note that if 7 is compact, then g 7 is a polynomial. Similarly, we define the principal 
part of g, denoted <?a, by 

9 A = 51 9aVja - 

aGA(g)nZ™ 

We say that g is non-degenerate if for every compact face 7 of A(g), the polynomials 

dg 1 dg 1 
dx\ ' ' dx n 

have no common zeroes in (K \ 0)™. We now take a closer look at A(g). Consider 
the ray (r, . . . , r) for r > 0. This ray intersects A(g) in exactly one point, say 
(tq, . . . , ro). We define p(A(g)) = 1/ro to be the remoteness of the Newton diagram 
of g. Now let k(A(g)) be the number of degrees of freedom of the supporting 
hyperplane of A(g) at the point (r , . . . , r ); we call fc(A(<?)) the multiplicity of the 
Newton diagram of g. 

We are now in a position to state the main result. Combining Theorems 7.6 
and 8.6 of and Theorem 2.6 of we have the following. 

Theorem 8. Let g be an analytic function defined in a closed neighborhood B e (0) C 
R n of the origin with a unique minimum of finite order at the origin. Assume that 
g(0) = 0. Let (p be a smooth function on B e (0) such that ip(0) 7^ 0. Then there 
exist an integer m between and n — 1 inclusive, a positive rational number a, and 
a positive real number c, all depending only on g, such that 

[ i P (z)e- s( - z ^ t dz = ap(0)t a I log*!" 1 + 0{t a \ log^™- 1 ) 

as t \ 0. Consider the Newton diagram of g and its remoteness and multiplicity, 
as defined above. Then 

t a \\ogt\ m > t p{Ai9)) \\ogt\ k( - A ^ 

for small enough t. Finally, if g is non- degenerate, then in fact a — p(A(g)) and 
m = k(A(g)). 

While we won't prove this theorem, we will comment on its proof and application. 
As might be expected, the resolution of singularities plays an important role. In 
fact, the central observation is that in the non-degenerate case one can construct a 
resolution of singularities from the Newton polytope such that the leading term has 
the properties given in the theorem. As such, the above represents a special case 
of the results of the preceding section. A more direct approach is taken in [22], in 
which the lead term of the integral is computed directly under the assumption of 
non-degeneracy. The approach taken in |22| also proves the one-sided estimate on 
the lead term given in the theorem. On the other hand, this approach fails to show 
that the Laplace integral has an expansion of the desired kind in the degenerate 
case. 

It's clear that Theorem [S] provides us with more precise information about 
Laplace integrals when the function g is non-degenerate. For this reason, it is 
helpful to know that degeneracy is rare, in some sense. In particular, Lemma 6.1 
of pQ asserts that, given a Newton diagram, the set of degenerate principal parts 
is a proper semi-algebraic subset of the space of all principal parts corresponding 
to the given diagram, the complement of which is everywhere dense. Approaching 
the issue of degeneracy from a different direction, we could ask to what extent a 
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change of coordinates might help. Recall that the Newton diagram of g and its 
associated features, including degeneracy, depend on the chosen coordinates. It is 
easy to come up with functions which are degenerate in one coordinate system, but 
which are non-degenerate in another. Unfortunately, trying to eliminate degener- 
acy by changing coordinates does not work in general; there exist functions which 
are degenerate in any coordinate system. 2 

We now return to the problem at hand, namely determining the limiting behavior 
of If we assume that g vanishes to finite order at each of the points Zi, then we 
can apply Theorem [S] to conclude that around each z< the Laplace integral (JJJ is 
asymptotic to a constant times t ai \ logi| m \ Since Z(t) is the sum of these terms, 
we see that \Xt converges to a limiting measure fi which is supported on those Zi 
with dominant leading term. 

So far we've been assuming that T consists of a finite number of points. In 
general, though, T can be quite complex. While we are far from a result which covers 
all possibilities, we can generalize the preceding a bit more. In particular, suppose 
that r consists of a finite collection of isolated, smooth submanifolds (possibly with 
boundary) Ni, . . . , N m of M. Then the integral over r e can be decomposed into 
integrals over the e-neighborhoods (Ni) e of the individual Ni. Further, the integral 
over each (AT i ) e can be written as an integral in the normal direction followed by an 
integral in the tangent direction. Hence, if g vanishes to finite order in the normal 
directions, we can treat the integral in the normal direction at each z £ N by 
the above methods. In particular, if we assume that for each Ni the asymptotics 
of the integral in the normal direction has the same leading term t ai | logi| OT< at 
all points, then the integral over (Ni) e will have leading term t ai \ logt| mi , and jttj 
will converge to a measure [i which is supported on the union of those N's with 
dominant leading term. Further, on each such Ni, [i will be absolutely continuous 
with respect to the induced volume measure on iV,-. 

While this analysis is fairly general, it certainly does not cover every case. One 
can easily come up with cases where, for example, T has accumulation points or g 
vanishes to infinite order, and where determining the limiting behavior of fi t would 
be quite difficult, if not impossible. 

In a different vein, we can also use Theorem [S] to prove the following lemma, 
which we will need later. 

Lemma 9. Let M be a smooth, compact, Riemannian manifold, and let x and y be 
points in M such that y € Cut(x). Suppose there is a point w GT and an associated 
sequence of times t% decreasing to zero such that ^L ti converges to the point mass at 
w as i —>■ oo. Then ~^7 2 h x ^ y {w) is degenerate. 

Proof. Let 7 be the minimal geodesic from x to y passing through w (so that w 
is the midpoint of 7). It follows from LemmaElthat V 2 h Xty (w) is degenerate if and 
only if 7 is conjugate. For the remainder of the proof, we assume that \7 2 h Xty (w) 
is non-degenerate. Hence 7 is not conjugate, and because y € Cut(x), there must 
be at least one other minimal geodesic from x to y. Let 7 be such a geodesic and 
let w be its midpoint. Then, because ~S/ 2 h x ^ y (w) is non-degenerate, there are some 



2 The two-dimensional case is almost an exception. In 1211 . Varchenko proves that for an 
analytic function with an isolated minimum, there always exists what he calls adapted coordinates, 
in which the leading power of t is equal to the remoteness of the Newton diagram. 
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positive constants 8 and C such that 

W* [B s {w)} _ I Bs (w) H (x,z)H (z, y)e- 9 ^' t dz 
[B s (w)} ~ f BsM H (x,z)H (z,y)e-B(*)/tdz 
> C for all sufficiently small t. 

This follows from the fact that, by multiplying g on B§(w) by some positive con- 
stant, g on Bs(w) can be made smaller than g on Bs(w) (where this comparison can 
be made by introducing normal coordinates on each of these balls), and, by further 
reducing this constant, any problem arising from the Hq or the volume form can 
be overcome. Since 8 can be chosen such that B$(w) and B$(w) are disjoint, this 
inequality shows that no limiting measure can be supported only at w. We have 
shown that if ^J 2 h x ^ y {w) is non-degenerate, then no limit measure can be the point 
mass at w. This proves the lemma. □ 

3.7. A characterization of the cut locus. The results of the previous section 
show that n can be a point mass, even though there are multiple minimal geodesies 
from x to y. For example, consider the case when there are two minimal geodesies 
from x to y, one of which is conjugate and one of which is not. Then the limiting 
measure will be a point mass at the middle of the conjugate geodesic, and the 
1/t term will vanish. (In terms of the Brownian bridge, if we require the particle 
to travel from x to y "instantaneously," then with probability 1 it travels along 
the conjugate geodesic.) Thus, it is clear that the 1/t term in the asymptotics of 
W 2 A A E t (x,y) is not sufficient to determine when y € Cut(x). On the other hand, 
we have the following result characterizing Cut (a;) in terms of the asymptotics of 
V 2 E t (x,y). 

Theorem 10. Let M be a compact, smooth Riemannian manifold, and let x and 
y be any two distinct points of M . We have that y ^ Gut(x) if and only if 

\imV 2 E t (x,y) =W 2 E(x,y) 

and y £ Cut(x) if and only if 

limsup || \7 2 E t (x, y)\\ = oo 
t\o 

where || ~S/ 2 E t (x, y)\\ is the operator norm, that is, the max of \V\ A E t (x,y)\ over 
all A S T y M with unit length. Further, if M is real-analytic, we have the stronger 
result that y € Cut(x) if and only if 

lim||V 2 ^t(a;,y)|| = oo. 

Proof. The case where y Cut(x) is just a restatement of the result of Stroock 
and Malliavin mentioned in Equation @. So we consider the case when y € Cut (a;). 
First suppose that the set of accumulation points of \x t contains at least one measure 
fi which is not a point mass. Let ti be an associated sequence of times such that 
fj,ti — * t 1 - Because /x is not a point mass and the support of n is disjoint from Cut(zy), 
there is some unit vector A £ T y M such that V aE{z, y) is not almost surely fi- 
constant. Hence, for this A, we will have a nonzero variance and so V 2 A A logp^ (x, y) 
will blow up like 1 jt\ . 

Next, suppose that all of the accumulation points of m are point masses. This 
will be the situation for the remainder of the proof. Take any w such that the point 
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mass at w is an accumulation point of /it, and let t, be an associated sequence of 
times (that is, the U are a sequence of times decreasing monotonely to zero such 
that \i ti converges to a point mass at to). Take a small ball Bg(w) around w, and 
let x\,...,x n be any smooth coordinates around w defined on this ball such that 
d Xl , ■ ■ ■ , d Xn form an orthonormal basis for T W M. Then we wish to consider the 
function 

V(U) = max Var^* {xA 

./• :i <•:• 

where here we consider the Xj to be equal to zero outside of Bg(w). 

Define p w j to be equal to 2 \h x ,y ~ h x , y (w)] on B$(w) and zero elsewhere, and 
let tt w j{s) be the measure (with respect to Riemannian volume) of the subset of 
B$(w) where p w $ is less than s. Note that Hq{x,-)Hq{-,d) is bounded above and 
below by positive constants on r e . In particular, this means that, for the purposes 
of integration, we can absorb this factor into the volume form. Doing so greatly 
simplifies the formulas, and thus we will assume this is the case for the remainder 
of the proof. As a result, for the remainder of the proof we let Z(t) be the integral 
of exp [— 2{h XyV — h x ,y(w))/t] over T e . (Note that Z(t) is defined in terms of the 
integral over all of T e , instead of restricting to Bg(w).) Finally, we set 



fu(xi 



,) = exp [-p Wt s(xi, . . . , x n )/U] /Z{U) 



on Bg(w) and equal to zero elsewhere, so that f ti is the density of pt t with respect 
to Riemannian volume on B$(w). (Technically, ft t also depends on w and S, but 
to include this in the notation would be unmanageable.) 

We wish to estimate V(ti) in terms of the measure of the set where is at least 
half of its maximum value, /^(O, ... ,0) = \ jZ{ti). Call this set Si/ 2 (U). First, we 
observe that the measure of Si/ 2 (ti) is equal to Tr w j(U log2). This follows from 
noting that 



{x\ , . . . , Xn) . ft (xi , . . . , X n 



= { 



{x\ , . . . , x n ) : 



exp [p Wt s(xi, 



,0)} 

• 7 X n )] 



> 



Z(t) - 2Z(t) 

= {(xi, ...,X n ) ■ Pw,d(xi, ...,£„)< tl0g2} . 



Next, we have that 



V(U) > max 
je{i,...,n} 



in / ( 



mm 

ae 



! o/*i(°) vo1m(xi, . . . ,x n )dx\ ■ ■ ■ dx n 



Because the Xj are smooth and orthonormal at w, we can choose d small enough so 
that the ratio of the Euclidean volume form determined by the Xj to the Riemannian 
volume form (which, we recall, includes the factor H (x, -)H n (-, y)) is bounded from 
below by 1/VD and from above by \J~D on Bs(w), for some positive constant D. 
Assuming that this is the case, we have that 



V(ti)> max k(0) 

j£{l,...,n} D 



(xj — a) 2 dxi ■ ■ ■ dx n 



and that the measure of Si^iU) with respect to the Euclidean volume induced by 
the Xj (which we're now integrating against) is at least ns(ti log2)/2. 
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Now suppose we take the Steiner symmetrization (see |17| for the definition and 
basic properties of Steiner symmetrization) of Si/2{U) with respect to x\. Then it's 
clear that the above integral with respect to x\ is now minimized by taking a = 
and that it can only have decreased after the symmetrization, while the integral 
with respect to any other coordinate remains unchanged. Hence, if we symmetrize 
with respect to all of the coordinates and call the resulting set Sym(S'i/2(ti)), we 
see that 



V(ti) > max 7^/ti(0) / x 2 dx\ ■ ■ ■ dx Tl 

Jsym(S 1/2 (U)) 



1 

je{i~~,n} D 

Summing over j and writing r = x\ + • • • + we have that 

V(U) > -j^-ftM I r 2 d Xl ---dx n . 

Un JSym(S 1/2 (ti)) 

We recall that Steiner symmetrization preserves the Lebesgue measure of sets, and 
thus the above integral is being taken over a set of measure at least ir Wt s(ti log2)/2. 
On the other hand, it's clear that for a given measure for Sym(S , 1 / 2 (ii))i this integral 
is minimized when Sym(S l 1 / 2 (ii)) i s the ball of the appropriate measure centered at 
the origin. Computing the integral of r 2 over such a ball, we conclude that 

(11) V(U) > - ^fZ'K^ K^(Mog2))^ f u (Q). 

Note that Equation is valid (possibly with different constants) for any w with 
associated times U (meaning that p ti converges to a point mass at w) and smooth 
coordinates Xj which are orthonormal at w, once we choose small enough S and 
small enough ti. 

Our next task is to compare the asymptotics of ir Wt s(ti log 2) and /t ; (0) = 1/Z(ti) 
as ti \ 0. Let ir(s) be the analogue of tv Wi s(s) over all of T e . Then for any large 
enough a > 0, we have (up to exponentially small error, which we ignore) 

Z(t) = TT(a)e- a ^ + - f 7r(s)e- s /* ds. 
t Jo 

Choose any (3 > 0. Then for small enough t, 

-, pa i p-ftt\ogt -, pa 

- Tr(s)e' s/t ds = - / n{s)e- s / t ds + - Tr(s)e' s/t ds 

t J0 t Jo t J-/3tlogt 

< n(-pt\ogt) (l - t p ) + n(a) (t - e" Q /*) . 
It follows that, for small enough t, we have 

(12) Z(t) < -7r(-/3tlogt) + n{a)t 13 . 

Next, we claim that 7r(s) is bounded from above and below by positive powers 
of s (times a constant) for small enough s. To see that, first note that p is bounded 
from above by some non-negative function with a single zero and a non-degenerate 
Hessian at that zero, because the Hessian of p over all of T e is controlled from 
above by compactness. The volume of sublevel sets of this comparison function is 
easily seen to be asymptotic to a positive power of s, and thus ir(s) is bounded 
from below by this power of s. Next, we observe that the Hessian of p is always 
non-degenerate in the radial direction (in polar coordinates around x) and that the 
second derivative in this direction is bounded from below by a positive constant, 
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again by compactness. Hence p is bounded from below by some quadratic function 
of the radius alone. This comparison function also has the volume of its sublevel 
sets asymptotic to a positive power of s, and this provides an upper bound for 7r(s). 

Now let a be the infimum of all positive reals b such that, for some constant c 
and small enough t, ct b < ir(t). Choose any small e > 0. Then there must be some 
sequence of times tj \ such that 

T a-e \ 

Let the f3 from equation ^| be given by /3 = a — e, and let Tj be a sequence such 
that Ti = —(Sfi logTi for all i. Then, applying equation (|12J) . we have that 

Zin) < n(n) + 7r(a)rf 

which implies that, for some constant C, we have 
(13) > 



Zfa) ~ CrTI log Til"-*' 

By compactness, we can assume, after passing to a subsequence, that p B f i converges 
to some limit measure, which must be a point mass. Thus, we can take U = Ti. 

In the remainder of the proof, we will assume that C is some appropriate con- 
stant, the exact value of which may vary from appearance to appearance. Using 
that ti = Ti and applying Equations (jl 1|> and we have that 

v{u)>c* wAul0g2) ^ 



tr e \iogu\ a -* 

By the same argument as before, we know that ir(t log 2) is the measure of the subset 
of T e where the density of /i t is at least half of its maximum value. Then, because 
fi ti converges to a point mass at w, we must have that 2ir w } $(ti log 2) > 7r(^ log 2) 
for small enough fj. Thus ir Wt g(ti log 2) > Ctf* +e (for some potentially different 
constant C), and we conclude that 

V(U) > C f._- ^(t,lo g 2) 2 /". 
\\ogU\ a e 

Finally, it remains to estimate the asymptotics of n Wl s(t). Because we're assum- 
ing that pa converges to a point mass at w, Lemma implies that the Hessian of 
p must be degenerate at w. Since the Hessian of p is degenerate at w, its Newton 
diagram must be dominated by the diagram of x\ + X)™=2 x i ' possibly after rela- 
beling the coordinates. Hence our discussion of Laplace asymptotics (in particular, 
Theorem [8| implies that ir Wt s(t) > Ct%~ Dn where D n is some positive constant 
depending only on the dimension n of M . If we choose e < D n /2n, then we see 
that 



V(U) > c 



|log^| a " ? 

This proves that there is some point w and an associated sequence of times ti such 
that, for any smooth coordinates Xj orthonormal at u>, at least one of the Xj has 
variance that goes to zero slower than U. 

Because y ^ Cut(w), we can choose vectors A\,...,A n in T y M such that the 
functions Va,-E(z,2/) — VAj-E(w, are smooth coordinates around w which are 
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orthonormal at w. Thus the above argument applies, and we conclude that for 
some Aj, the variance of WA j E(z,y) — WA j E(w,y) goes to zero slower than ti. 
Normalize this Aj to have length one, and call the result A. Then \7aE(z, y) 
differs from V AjE{z,y) — \7 AjE(w,y) only by an afSne transformation, and thus 
its variance differs only by multiplication by some (positive) constant. We conclude 
that V AE{z,y) has variance that goes to zero slower than t{, and this completes 
the proof of the theorem for smooth manifolds. 

In the case when M is real-analytic, the proof follows the same lines, but is 
somewhat simpler. First of all, we know that the limit measure [i is unique. If it is 
not a point mass the result follows just as before, only now we don't have to pass 
to a subsequence of time and thus our result holds for the limit, not just the limit 
supremum. 

Now suppose that fi t converges to point mass at some point w. Then we have, 
using Equation Ijllfl and noting that we don't need to pass to a subsequence, 

V ^ ^ : ? ATe ^V\^ M^ g 2))^ / t (o). 

Again, the point is to compare the asymptotics of ir$ (Hog 2) and /t(0) = \/Z(t). 
We observe that 1/Z is essentially determined by the Laplace transform of its (see 
[2*2*] for a discussion of this fact). So we are comparing a function with its Laplace 
transform. In the proof of Theorem [7| we showed that Z{t) ~ ct a | log t\ m for 
some c > 0, some non-negative rational number a, and some non-negative integer 
to. Direct computation shows that any function of this form is asymptotically 
equivalent (up to multiplication by some positive real number a) to the reciprocal 
of its Laplace transform. (That this is true for real-analytic functions but not 
necessarily for smooth function is one reason for the increased difficulty in that 
case.) Thus we have that 

Vit) ~ a 2f , ovo h*> K>.'(* lQ g 2 ))" • 

Again, we know that ir Wt s(t) > Ct%~ Dn . From here, the theorem follows just as 
above. □ 

3.8. Lower order leading terms. Theoreml 1 01 shows that the Hessian of E t (x, y) 
always blows up on the cut locus. However, we've already seen that, if the limiting 
measure is a point mass, there will be no 1/t term in the expansion. In other words, 
the variance will go to zero, but it will to do more slowly than t. In this case, it is 
more difficult to determine the leading term of the expansion of V 2 E t (x,y), since 
it involves further terms in the Laplace asymptotics. Because of this, we can only 
discuss the simplest case. 

We consider the case where T consists of a single point, z\, such that g can be di- 
agonalized in a neighborhood of z\ . Recall this means that there exists coordinates 
Uj around z\ such that 



g(ux, ...,«„) =X/ 
i=i 

for some positive integers k\ < ■ ■ ■ < k n . Since we assume y £ Cut(x), this geodesic 
must be conjugate and thus k n > 2. Further, since the Hessian of h x>y is non- 
degenerate in the radial direction, k\ = 1. The advantage to this situation is that 
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we have a full asymptotic expansion from Equation 0, rather than just the leading 
term (at least that's the advantage of assuming diagonalizability; assuming that T 
contains just a single point makes the computation tractable). Let I be the smallest 
index such that ki — k n . (Note that we allow I = n, but we must have I > 2.) Thus 
ui, . . . ,u n correspond to the directions of maximal conjugacy. Keeping the first two 
terms of the expansion in Equation © , we see that 



<£>(tti, . . . , u„)exp 



Era 2kj " 



3 



t 



du\ ■ ■ ■ du n 



r(i/2fei) ■ ■ ■ r(i/2fc„) 1 1/2fcl+ ... +1/2fc „ y(Q) 

k\ • • • h n 

^ r(i/2fc 1 ) ■ ■ ■ r(3/2fc 3 ) ■ ■ ■ r(i/2fc n ) , 1/2fcl+ ... 3/2fc ,... +1/2fc „ 9 2 n(m 

+ ^ fc 1 -..(2fc,)...fc„ ' d^/ {0) - 

We will be interested in the cases if — (WAE(z,y))*Ho(x,z)Ho(z,y)vol u (z), 
where * denotes 1 or 2 (corresponding to the expectation of the square and the 
square of the expectation which appear in the variance of V aE(z,u)), and (p = 
Hq(x, z)Hq(z, y) vo\ u (z) (which gives the expansion of Z(t)). Dividing the relevant 
expansions and keeping the first two terms shows that (here we use that kj = k n 
for all j = I, . . . , n) 



[(V A E(z, y))*} ~(V A E(0, y))* + W^. ^ £ 



mr(VA E(0, ytfjjL (H (x, 0)H (Q, y) vol„(Q)) 



H (x,0)H {0,y)vo\ u (0) 
If we now compute the variance, most of this cancels, and we see that 



Finally, we compute that 



n , s i r(3/2fc„) ^ { d 

3 — ' 

There are several things to observe regarding this formula. First of all, it shows 
that it's easy to produce situations in which A E t (x,y) blows up at a rate in- 
termediate between 1/t and 1, and that every rational of the form — (m — \)/m 
for a positive integer m can be achieved as the order of the leading term. This 
gives concrete intuition to the results of Theorem ^3 Second, in the case of a 
single minimal geodesic, we see that knowing the order of the leading term tells 
us the maximum order of degeneracy of the Hessian of h XtV and thus also "how 
conjugate" this minimal geodesic is, at least in the "most conjugate" directions. 
Finally, it's relatively easy to see that the coefficient of this leading term also has 
geometric significance. In particular, as a function of A E T y M, it is a symmetric, 
non-positive definite quadratic form such that the dimension of its kernel is equal 
to n — I. To see this, let j(z) be the minimal geodesic from z £ B e (zi) to y and let 
vj e T y M be the derivative of the unit tangent to 7(21) at y with respect to d/duj. 
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Then d/duj\/A dist(zi, y) is non-zero if and only if (vj,A) is non-zero. Because 
h x ,y is non-degenerate in the radial direction, we can assume that the d/duj (for 
j = I, - ■ ■ ,n) are perpendicular to 7(^1), and thus we see that 

d d 
-—S7 A E{0, y) = dist(zi, y)- — Va dist(zi, y) for j =/,... , n. 
ouj ouj 

Observing that, for j = I, - ■ ■ ,n, the Vj are linearly independent, the desired result 
follows. Thus, the leading term not only gives us the maximum order of degeneracy 
of the Hessian of h X:V , but also tells us the dimension of the subspace on which this 
maximum degeneracy is achieved. 

We find it interesting that, at least in this case, a term in the expansion further 
down than \jt has such a nice geometric interpretation. Unfortunately, these terms 
are hard to compute in general (the present case of a single minimal geodesic is the 
easiest case, yet even here completely working out the above calculations is rather 
laborious), and we don't know anything more about them. 



In the last section, we studied the asymptotics relative to fixed points x and y. 
Now, we turn our attention to considering how Corollary[S]can be used to study the 
distributional Hessian of E(x, y), where we think of a: as a fixed base point and thus 
of E(x,y) as a function of y. In particular, Varadhan's result (see Equation 
implies that E t (x, y) is a smooth mollifier of E{x, y) as t \ 0. Hence, computing the 
distributional limit of A E t (x, y) as t \ gives \7 AA E(x, y) as a distribution. 

We note that the approach below is not the only way to study the distributional 
Hessian of E{x,y). For example, many of these results are consequences of the 
fact that the cut locus is rectifiable with respect to (n — l)-dimensional Hausdorff 
measure, as proven by Mennucci |13| using viscosity solutions of Hamilton- Jacobi 
equations and geometric measure theory. 

4.1. The results. Away from Cut(x), the distribution y) is just a smooth 

function, and Equation (J2J) tells us that V\ A E t (x, y) converges to this limit uni- 
formly on compact subsets of M \ Cut(x). This means that the singular part of 
V AA E(x,y), which we denote s'mg(V A A E(x, y)), is supported on Cut(x). Con- 
sidering Theorem we see that any contribution to the singular part must come 
from the variance term. In particular, for any smooth function ip we have 



where fit, y is the measure /j,t from above corresponding to the point y, the variance 
of VAE(-,y) is taken with respect to the first variable, and we use the notation 
(ip, D) to denote the action of the distribution D on the smooth function ip. 

Let (r, 9) £ R + x §" _1 be (normal) polar coordinates around x, and let d{9) be the 
distance to the cut locus along the geodesic corresponding to 9. Let U x = {(r, 9) : 
9 £ § ra_1 ,7- e (d(9) — e, d(9))}. Then the exponential map gives a diffeomorphism 
from U x to M \ Cut (a;), and dU x = (d(9), 9) is the tangential cut locus, that is, the 
preimage of Cut (a;) under the exponential map (or more accurately, the connected 
component of the preimage closest to the origin) . This gives a natural identification 
of §" _1 with the set of minimal geodesies from x to Cut(x) and with dU x . We will 
frequently assume this identification, for example, when stating that some 9 S § n_1 
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corresponds to a conjugate geodesic. Because Cut(x) has measure zero, we can write 
the above integral in polar coordinates on U x as 



We are now in a position to state the following theorem. 

Theorem 11. Let M be a smooth, compact Riemannian manifold and let x be any 
point in M. Let A be any smooth vector field on M. Choose (normal) polar coor- 
dinates on T X M and define U x as above. Then the right-hand side of equation i|14|) 
defines a negative measure on dU x , which is absolutely continuous with respect to 
the measure d9 on dll x obtained by identifying it with S n_1 via polar coordinates. 
Denote the corresponding Radon-Nikodym derivative by p(9); then p{9) is bounded. 
Thought of as a distribution on M , A E(x,y)) has as its singular part a negative 
measure v x ,a supported on Cut(a;), and further, v x ,A is given by the pushforward 
of p{9) d6 under the exponential map. 

While Theoremll II shows that the singular part of V 2 A A E(x, y)) has a relatively 
nice structure, it says little about the relationship between p and the geodesic 
geometry of M . In order to describe the relationship, we will need a bit more 
notation. Let C C S™ -1 be the set of all 8 which correspond to conjugate geodesies. 
Next, say that the geodesies corresponding to 9 and 9 are associated if they lead 
to the same point in Cut(x) (that is, if d{9) = d{9) and (d(9),9) and (d(9),9) are 
mapped to the same point under exp^,). Let P C § n_1 be the set of 9 € § n_1 \ C to 
which there is associated precisely one other 9 £ §™ _1 and such that 9 £" C. Finally, 
let R = § n_1 \ (CUP) (so R consists of non-conjugate 9 which are associated to 
more than one other geodesic or which are associated to a conjugate geodesic). The 
three sets C, P and R are disjoint and partition S" _1 . 

Theorem 12. Let the hypotheses be as in Theorem \ 111 If 9 £ C , then p{9) = 0. 
Also, R has measure zero as a subset o/§" _1 with respect to d9, and p is continuous 
except possibly at points of R. Finally, there is an explicit expression for p on P 
(see Equation (|15|) below). 

In order to give the explicit expression for p on P, we will need still more notation. 
Let 9 be in P, let 9 be the (one) associated geodesic, and let y be their common 
endpoint. Also, let z be the midpoint of the geodesic corresponding to 9, and let 
B be the (non-degenerate) Hessian of h XtV at z. Let z and B be the corresponding 
objects associated to 9. Next, let A y £ T y M be the value of the vector field A 
at y. Then let ip be the angle between the geodesic given by 9 and A y , ip be the 
angle between the geodesic corresponding to 9 and Ay, and tp the angle between 
the geodesies 9 and 9. Then for any 9 e P, 



(14) (<p, S mg(V% A E(x,y))) 




(15) p{9) 
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Note that the volume element, all of the functions Hq appearing above, and both 
B and B can be computed from the Jacobi fields along the geodesies given by 9 
and 9. 

Theorem 1 1 21 tells us that the only contributions to the singular part come from 
points in P, which on M are places where locally the cut locus looks like a smooth 
hypersurface and the singular part of W 2 E(x,y) is just given by the jump dis- 
continuity of VE(x, y) across this hypersurface. While the cut locus itself can be 
quite complicated (for example, it may not be triangulable, as shown by Gluck and 
Singer |S|), the singular part of W 2 E(x,y) is supported only at those points with 
the nicest local structure. 

We should point out, however, that even though there may not by any singular 
part of s \7 2 E(x, y) in a neighborhood of a conjugate point, the Hessian will not be 
smooth at a conjugate point. To be precise, suppose that 7 : [0, dist(x, y)] 1— > M 
is a minimal geodesic (with unit speed parametrization) from x to y. Then if we 
consider V 2 E(x, 7(s)) as s f dist(x,y), we have that the Hessian, as an operator 
on vectors fields A, will blow up if and only if 7 is a conjugate geodesic. Further, 
whether or not this blow up occurs for a given A depends on how A relates to the 
directions in which 7 is conjugate (we give the precise formula in the next section). 

This leads us to the following picture. The distributional Hessian of E(x, y) 
is composed of two pieces, an L 1 function, which is just the regular Hessian on 
M \ Cut(x), and a singular part, which is the measure supported on Cut(x) as 
described above. At a point y which is the image on M of a point in P, the 
L 1 part stays bounded as we approach y along either minimal geodesic, but the 
singular measure is supported at y, as discussed above. On the other hand, if y is 
a conjugate point, then the singular measure may not be supported at y, but the 
L 1 part will blow up as we approach y along any conjugate geodesic. Because the 
set R has measure zero in the decomposition of the tangential cut locus, these two 
cases completely describe the non-smooth behavior of the distributional Hessian of 



4.2. The proofs. The proof of Theorcmsllllandll2lwill require a little preparation. 
The first thing we need to do is to justify exchanging the integration with the 
limits in Equation (|14|l . Equation l|14|l immediately implies that the singular part 
is non-positive, and thus is a non-positive measure (as opposed to a higher order 
distribution). Because the singular part is a non-positive measure, we can estimate 
the Hessian in terms of the Laplacian. In particular, choose any closed, connected 
set ft C S™ -1 the boundary of which has finite (n — 2)-dimensional Hausdorff 
measure, and let Q(e) be the set 



E(x,y). 



{(r, 6) : 9 € O and r e \d{9) - e, d(9)]} . 



Then we have 




<p(r,9)V% A E t (x,y)vol(r,9)drd9 < dM^I 



2 
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for small enough t and e and some constant C\. Here |/|oo denotes the L°° norm 
of / on the set f2(e). This last integral can be estimated as 

vol(r, 9) drde < e|fi| s „-i| vol(r, 0)|oo- 

Q(e) 

Let n(r, 9) be the outward pointing unit normal to f2(e). We use integration by 
parts to write 



/ AE t (x,y)vo\{r,9)drd9 = - [ (VE t (x, y), n(r, 0)) dV n] 

JO(e) Jdn{e) 



dTC M 



where T~Cl7 denotes (n — l)-dimensional Hausdorff measure relative to the Rie- 
mannian volume measure. The right-hand side is a smooth function of i, so the 
only question is what happens to it as t \ 0. On the "walls" of dfl(e), that is, the 
set 

W(Q(e)) = {(r, 6):6edn and r G (d{9) - e, d(8))} , 
the results of Stroock and Malliavin show that WE t (x,y) converges (pointwise) 
to VE(x,y) = rd r . Further, the Gauss lemma shows that, on this set, n(r,9) is 
perpendicular to d r (both in the Riemannian metric and the Euclidean metric). 
Thus, the integral over the "walls" goes to zero with t, which we will write 

{VE t (x,y),n(r,6)) dU n M x = o(l). 

Next, we consider the integral over the "top" and "bottom" of dft(e), that is, the 
sets 

T(ft(e)) = {(r, 9):6edn and r = d(6)} 
and B(fi(e)) = {(r, 9) : 9 e d£l and r = d{9) - e} . 

The results of Stroock and Turetsky show that the norm of VE t (x, y) is bounded for 
all small t, and thus (X7E t (x,y),n(r,9)) is bounded on both T(f2(e)) and B(fl(e)). 
Thus it remains only to control the tC^ 1 measure of T(f2(e)) and B(fl(e)). 

First note that the volume density vol(r, 9) is bounded, and thus we can estimate 
the tClf 1 measure of a set from above by | vol(r, 0)|oo times the r H n ~ x measure of 
the set, where we use 7i™ -1 to indicate the (n — l)-dimensional Hausdorff measure 
relative to the Euclidean volume on the tangent space. The key to estimating this 
measure is the result of Itoh and Tanaka ^I] that the distance to the cut locus 
(that is, the function d(9)) is Lipschitz. It follows that 

rC 1 - 1 (T(n(e)) U B(n(e))) < 2 (1 + Lip(d)) |fi]s»-i |d|oo. 
Combining the above results, we conclude that 



(16) 



ip{r 1 9)V 2 AA E t {x,y)vo\{r, 9) dr 

d(8)-e 



d9 



< 



C 2 \v\oo\A\ 2 \ voKr.ejUlfilsn-i [(1 + Lip(d)) \d\ x + o(l) + 



for small enough t and e and some constant Ci. 

This justifies exchanging the limit as t \ with the integral over 9 in equation 
(|14Jl . In addition, because the variance and volume density are always non- negative 
and f is smooth, we can also exchange the limit as e \ with the integral over 
9. This shows that the measure given by the right-hand side of equation lfH[|l is 
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absolutely continuous with respect to the measure d9 and that its Radon-Nikodym 
derivative p is given by 

,d(fl) 4 

(17) p{9) = - lim lim / - Var^ ^ 8 ' {V A E{z, (r, 0))) vol(r, 0) dr. 
c\ot\oJ d{g) _ £ t 

We now turn our attention to the sets C, R, and P. On the set C, we want to 
show that p = 0. To do so, we won't work directly with l|14|). but rather with the 
definition of the distributional Hessian. In particular, choose any 9 € C, and let 
£7,5 C S™ -1 be a disk of radius 6 around 9. Then starting from (|16fl and letting t 
and e both go to zero, we see that 

PW dip < C 2 \A\ 2 \ vol(d(V0, ^)|oo|fi tf | s »-i (1 + Lip(d)) |d|oo- 

Everything on the right-hand side is bounded from above, so if we divide both sides 
by |fiff|gn-i and let S go to zero, then Lebesgue's differentiation theorem tells us 
that —p(9) is almost everywhere on C less than or equal to a constant times the 
limit as S \ of the L°°-norm of vol(d(t/j),ip) over fig. Because vol(d(ip) , ip) is 
continuous and equals zero at 9 (because the corresponding geodesic is conjugate), 
this shows that p is equal to zero almost everywhere on C . Because p is a density, 
its value only matters up to almost everywhere equivalence, and thus we can take 
p to be zero on all of C. 

Next, we prove a lemma which shows that when computing p, we can ignore the 
set R. 

Lemma 13. For any compact manifold M and a basepoint x € M , the correspond- 
ing set R, as defined above, has measure zero as a subset o/§™ -1 with its standard 
volume measure. 

Proof. As usual, we identify S" _1 with the set of minimal geodesies from x to 
Cut(x). Let S\ be set set of non-conjugate 9 which are associated to more than one 
other geodesies, all of which are non-conjugate. Let S 2 be the set of non-conjugate 
9 which are associated to a conjugate geodesic (and possibly to other geodesies as 
well). Then R= Si U S 2 - 

We first consider S\ . In particular, choose any 9 6 S\ and let and let 9\, ... ,9k 
for some k > 2 be the associated geodesies (there are necessarily only finitely many 
because otherwise they would have an accumulation point, forcing at least one to be 
conjugate). Let y be their common endpoint. In terms of the exponential map, the 
fact that 9 is not conjugate means that there is a neighborhood U of (d(9, 9) in T X M 
which is diffeomorphic to a neighborhood V of y under the exponential map. The 
same is true for each of the 9i and we use Ui for the corresponding subsets of T X M. 
By choosing these neighborhoods small enough, we can assume that the preimage 
of V under the exponential map is precisely equal to UU Ui U • • • Uf/fc. Now we can 
define a smooth function / on V to be the length of the corresponding element of the 
tangent space in U, and we can similarly define the We will call such functions 
local distance functions. It follows that, for any point in V, the Riemannian distance 
is given by min{/, /1, • ■ • ,/fe} and further, that the number of minimal geodesies to 
that point is given by the number of these local distance functions which achieve 
this minimum. At y, we know that all of these local distance functions achieve 
their common minimum, that they all have gradient of length one, and that these 
gradient vectors are all distinct. Given this, it is straight-forward to see that the set 
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of points in V where at least three of the these local distance functions achieve the 
common minimum is given by a finite union of smooth submanifolds of dimension 
no more than n — 2. Since V and U are diffeomorphic under the exponential 
map, it follows that the set of points in U which correspond to points with more 
than two minimal geodesies has (n — l)-dimensional Hausdorff measure equal to 
zero. Projecting this set onto the 9 coordinate gives the intersection of S\ with a 
neighborhood of 9 7 and we conclude that this set has measure zero. This shows 
that every 9 € Si has a neighborhood in which Si has measure zero, and it follows 
that all of Si has measure zero. 

We now consider S2 ■ This will require some facts about the set of points in M 
which are conjugate to x along a minimal geodesic; call this set the conjugate-cut 
locus. Observe that the set of points in M corresponding to S2 is contained in the 
conjugate-cut locus. Let C\ be the set of all vectors in T X M such that the kernel 
of dexp x has dimension 1. In Warner showed that Ci is a smooth (n — 1)- 
dimensional submanifold of T X M. Let T be the set of points in Ci such that the 
kernel of dexp^. is contained in the tangent space to Ci at that point. Let H be the 
set of all vectors in T X M such that the kernel of d exp x has dimension at least two. In 
the proof of Lemma 1.1 of |24| . Warner showed that the image under the exponential 
map of TUH has (n — l)-dimcnsional Hausdorff measure equal to zero. In the proof 
of Proposition 3.2 of [S], Hebda proved that a point of Ci \ T cannot correspond to 
a minimal geodesic. This means that the conjugate-cut locus is precisely the image 
of TU H under exp^, and thus has (n — l)-dimcnsional Hausdorff measure equal to 
zero. Now choose any 9 £ S2 and let y be the corresponding point in M. As before, 
we can choose neighborhoods U and V of (d(9), 9) and y such that the exponential 
map gives a diffeomorphism between them. In particular, this means that there is a 
neighborhood W C §" _1 of 9 such that no two points in W correspond to the same 
point in M, Suppose S2 H W has positive measure. Then the corresponding set of 
points in U would have positive (n — l)-dimensional Hausdorff measure. But these 
points are contained in the conjugate-cut locus and so this contradicts the above. 
Hence we conclude that S2 H W has measure zero. Since this holds for any 9 € S%, 
it follows that all of £2 has measure zero. Given the decomposition R — Si U S%, 
this completes the proof of the lemma. □ 

In order to complete the proofs of Theorems El an d El it suffices to compute 
p from equation (|17J) on P. This is fairly straight-forward, if somewhat tedious. 
In this case, C e consists just of balls around z and z. We begin by estimating 
fliyM [/] f or ari y smooth test function / and small t and 5. First of all, let y$ be 
the image under the exponential map of (d(9) — 5,9). Then let zg be the midpoint 
of the geodesic from x to ys in the direction 9, and let zg be the midpoint of the 
(non-minimizing, if 8 > 0) geodesic from x to ys in the direction close to 9. Because 
the exponential map is a diffeomorphism near (d(9),9), it follows that z$ is well- 
defined for small enough 6 and depends smoothly on 5. We know that h Xt y s has 
non-degenerate Hessian at both zg and zg, and we denote these Hessians by Bg and 
Bg. An easy computation shows that the volume form associated to coordinates 
which diagonalize 2h x ^ ys around zg is l/vclet Bg, and similarly for zg; note that Bg 
is the Hessian of h x ,y s , without the factor of 2 (this is essentially a consequence of 
the 1/2 which appears in the second order Taylor expansion). Then Equation © 
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gives 



f(u)H (x, u)H(u, ys) exp 



2h x,y s {u) 



du = 



t n ' 2 exp 



t n ' 2 exp 



t 



H {x,z s )H {z s ,y s ) ' 

/( * } TraT — + ° ,(t) 

f(zs)- 



\fdet B s 



■O e (t) 



Here we use the notation Og(t) to indicate that the error term as t \ depends 
on 8, but does so uniformly for all sufficiently small 8; in particular, the integral of 
0${t) with respect to 8 is 0(t). 
Let 



F 



H(x,z)H(y,z)VdetB 
H(x,z)H(y,z)VdetB 



Also note that f(zs) = f(z) + 0(6), f(z§) = f(z) + 0(6), and so on for H , h, 
and B. In addition, we will need to know how 2h x ,y s (zs) compares with 2h Xiys (zs). 
Elementary trigonometry shows that this difference can be written as 

2h XiVs (zs) - 2h x . V5 (z s ) = dist(x,y)6(l - cosip) + 0(6 2 ). 

Then since Z(t) is obtained simply by taking f(u) = 1 in the above, we have that 

E"*.' [/] = 

f(z) + 0(5) + (f(z)F + 0(6)) exp dist(x, y)(l - cos <p) + 0(6 2 ))] 



1 + (F + 0(6)) exp [-i(<5dist(x, y)(l - cos tp) + 0(8 2 ))] 
Using this, we can compute (after doing some algebra) that 

Vav^ [V A E(-,y)] = (V A E(z, y) - V A E(S, y)) 2 (F + 0(8)) 
exp [~\(5A\st(x,y)(l -costp) + 0(S 2 ))] 
{1 + (F + 0(6)) exp [-i(5dist(x, y)(l - cos^) + 0(8 2 ))] } 2 



O s (t). 



O s (t). 



Writing vol(d(9) — 5,9) as vol(d(0),6) + 0(8) and making the change of variables 
a = —5dist(x,y)(l — cos <p)/t, Equation lfT7| gives 



p(9) = 



i(V A E(z,y)-V A E(z,y)Y 



vol(d{9),9)x 
dist(x, y)(l — cos tp) 

(F + O(to))e" +O » fa ) 2 ) 
{1 + (F + 0(ta)) e«+o((*«) 2 )} 5 



lim lim 

e\ot\o J J(£jt) 



Os(t) 



[1 + 0(ta)] da 



where I(e, t) is the interval [— edist(cc, y)(l — cos <p)/t, 0] . 

Taking both limits causes the region of integration to become (—00, 0]. Further, 
it causes all of the O(-) terms to vanish. To see this first note that 0$(t) vanishes 
uniformly with t. As for the 0(ta) and 0((ta) 2 ) terms, O(to) is bounded on I(e, t) 
for all e and t and goes to zero uniformly on any compact subinterval. Using this, 
one can show that they don't contribute in the limit (to see this in detail, one can 
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make the further change of variables (3 — Fe a and compute the integral). Thus, 
taking the limits, the above integral becomes 



Fe a 



da 



1 



I -co {1 + Fe a } 2 ~* 1 + 
Using this in the above expression for p, along with the fact that 

{V A E(z,y) - X7 A E(z,y)) 2 = - dist(x, y) 2 \A y \ 2 (cost/) - cos^ , 

gives Equation Q15[l. This completes the proofs of Theorems 1111 and 1121 

Finally, we justify our earlier comments about the L 1 part of the distributional 
Hessian along conjugate geodesies. We begin by applying Theorem0|at a point y £ 
Cut(x). Because y is not in the cut locus, there is a single, non-conjugate minimal 
geodesic between x and y. Because this geodesic is not conjugate, 'V 2 2h x , y (z) is 
non-degenerate, and we choose coordinates u±, . . . ,u n around z such that 2h x ^ y = 
u\ + ■ ■ ■ + u^. We already know that, since we are not on the cut locus, the leading 
term in the Hessian will be the constant term, and in order to compute this term 
we will need the first two terms in the expansion of Theorem 0] We begin by 
computing, using the first two terms of the expansion in Equation that 



(18) 



f{u) exp 



B t (z) 



exp 



+t 



2h x , y (u) 



k(t/2, x, u)k{t/2, y, u) du ■ 



^h x ^y{z^ 



r/ 2 {r(l/2)'7(z)fc(t/2, x, z)k(t/2, y, z) vo\ u {z) 



r(i/2) n ^ 1 r(3/2) 



A" (f(z)k(t/2, x, z)k(t/2, y, z) vo^z)) + 0{t 2 ) 



Here the symbol A" means the operator YH=i ~§u?- Recall that 

k(t, x, y) = H (x, y) + tHi (x, y) + 0(t 2 ) 
and l{t,x,y,A) = -V A E(x,y) +tV A G 1 [x,y) + 0{t 2 ). 

Given Equation l |18| l and the expansions of I and k in terms of the Hi and the 
Gi, expanding the right hand side of the equality in Theorem 0] becomes simply a 
lengthy exercise in manipulating Taylor series. We won't reproduce the computa- 
tion here and will instead merely state the result. The coefficient of the l/t term 
is zero, as we know it must be, and taking the limit as t \ gives 



V\ A E{x,y) = 2 



A.A 



E(z,y) 



»=i 



If we consider what happens as y approaches Cut(a;) along a geodesic 7, we see 
that V 2 E{x,y) will blow up, as an operator, if and only if 7 is conjugate. This is 
because both W 2 E(z,y) and WE(z,y) remain bounded, and thus the only way for 
a blow up to occur is if at least one of the d Ui has its (Riemannian) length blowing 
up. This occurs precisely if the corresponding eigenvalue of ~S7 2 h x . y {z) is going to 
zero, and thus precisely if 7 is conjugate in the direction corresponding to d Ui . It 
is also easy to see that the blow up must be in the negative direction, and that the 
relationship between a given vector A and the d Ui determines whether or not the 
Hessian blows up for a given A. 
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